Shape invariant pitch modification of speech using a harmonic model
نویسندگان
چکیده
We present a simple but e ective approach to pitch modi cation of speech based on a harmonic model. Building on our time-scaling algorithm [1], pitch modi cation applies to a harmonically coded glottal wave estimate derived via a simple inverse ltering technique [3]. The modi ed glottal wave subsequently serves as input to an LPC vocal tract lter and the pitch-scaled speech is generated. Shape invariance is maintained in the glottal wave by exploiting the harmonic nature of the sine waves used to code each frame thus avoiding the need for \pitch pulse onset time" estimation. Furthermore, given its smooth shape it is not necessary to resample the glottal wave spectrum at the new harmonic frequencies. The original spectrum is merely compressed/expanded to produce the desired pitch change.
منابع مشابه
Shape invariant time-scale modification of speech using a harmonic model
A new and simple approach to shape invariant timescale modi cation of speech is presented. The method, based upon a harmonic coding of each speech frame, operates entirely within the original sinusoidal model [3] and makes no use of \pitch-pulse onset times" used by conventional algorithms. Instead, phase coherence, and thus shape invariance, are ensured by exploiting the harmonic relation exis...
متن کاملHigh-Quality Speech Modification Based on Pitch- Synchronous Harmonic and Non-harmonic Modeling of Speech
In this paper, we propose a high-quality speech modification method based on pitch-synchronous harmonic and non-harmonic modeling of speech. In the proposed method, the harmonic and non-harmonic parts of speech are modeled by the sum of sinusoids with frequencies corresponding to pitch multiples and with randomized frequencies, respectively. Then, harmonic and nonharmonic parts are synthesized ...
متن کاملEnhanced shape-invariant pitch and time-scale modification for concatenative speech synthesis
To preserve shape-invariance when pitch or time-scale modifying sinusoidally modelled voiced speech, the phases of the sinusoids used to model the glottal excitation are made to add coherently at estimated excitation points. Previous methods achieve this by estimating excitation phases at synthesis frame boundaries, disregarding the frequency modulation that may occur between the frame boundary...
متن کاملAn implementation and evaluation of two diphone-based synthesizers for Turkish
This paper presents two diphone based Turkish text-to-speech systems; the first system is realized inside the MBROLA project, a freely available multilingual speech synthesizer and the second system is based on shape invariant harmonic modeling. Both synthesizers use the same parametric representations of two diphone databases (male, female) obtained by processing speech data with a pitch async...
متن کاملRobust HNR-Based Closed-Loop Pitch and Harmonic Parameters Estimation
An important problem in speech coding framework is model parameters estimation. In most cases parametric speech coding methods do not preserve shape of speech waveform. This fact implies straightforward parameters estimation and analysisby-synthesis method is hardly used. A novel analysis-by-synthesis parameters estimation method in speech coders based on harmonic models presented. We introduce...
متن کامل